321 research outputs found
Preceding rule induction with instance reduction methods
A new prepruning technique for rule induction is presented which applies instance reduction before rule induction. An empirical evaluation records the predictive accuracy and size of rule-sets generated from 24 datasets from the UCI Machine Learning Repository. Three instance reduction algorithms (Edited Nearest Neighbour, AllKnn and DROP5) are compared. Each one is used to reduce the size of the training set, prior to inducing a set of rules using Clark and Boswell's modification of CN2. A hybrid instance reduction algorithm (comprised of AllKnn and DROP5) is also tested. For most of the datasets, pruning the training set using ENN, AllKnn or the hybrid significantly reduces the number of rules generated by CN2, without adversely affecting the predictive performance. The hybrid achieves the highest average predictive accuracy
On Optimizing Locally Linear Nearest Neighbour Reconstructions Using Prototype Reduction Schemes
This paper concerns the use of Prototype Reduction Schemes (PRS) to optimize the computations involved in typical k-Nearest Neighbor (k-NN) rules. These rules have been successfully used for decades in statistical Pattern Recognition (PR) applications, and have numerous applications because of their known error bounds. For a given data point of unknown identity, the k-NN possesses the phenomenon that it combines the information about the samples from a priori target classes (values) of selected neighbors to, for example, predict the target class of the tested sample. Recently, an implementation of the k-NN, named as the Locally Linear Reconstruction (LLR) [11], has been proposed. The salient feature of the latter is that by invoking a quadratic optimization process, it is capable of systematically setting model parameters, such as the number of neighbors (specified by the parameter, k) and the weights. However, the LLR takes more time than other conventional methods when it has to be applied to classification tasks. To overcome this problem, we propose a strategy of using a PRS to efficiently compute the optimization problem. In this paper, we demonstrate, first of all, that by completely discarding the points not included by the PRS, we can obtain a reduced set of sample points, using which, in turn, the quadratic optimization problem can be computed far more expediently. The values of the corresponding indices are comparable to those obtained with the original training set (i.e., the one which considers all the data points) even though the computations required to obtain the prototypes and the corresponding classification accuracies are noticeably less. The proposed method has been tested on artificial and real-life data sets, and the results obtained are very promising, and has potential in PR applications
Experimental Study of the Shortest Reset Word of Random Automata
In this paper we describe an approach to finding the shortest reset word of a
finite synchronizing automaton by using a SAT solver. We use this approach to
perform an experimental study of the length of the shortest reset word of a
finite synchronizing automaton. The largest automata we considered had 100
states. The results of the experiments allow us to formulate a hypothesis that
the length of the shortest reset word of a random finite automaton with
states and 2 input letters with high probability is sublinear with respect to
and can be estimated as $1.95 n^{0.55}.
Two-proton correlations from 158 AGeV Pb+Pb central collisions
The two-proton correlation function at midrapidity from Pb+Pb central
collisions at 158 AGeV has been measured by the NA49 experiment. The results
are compared to model predictions from static thermal Gaussian proton source
distributions and transport models RQMD and VENUS. An effective proton source
size is determined by minimizing CHI-square/ndf between the correlation
functions of the data and those calculated for the Gaussian sources, yielding
3.85 +-0.15(stat.) +0.60-0.25(syst.) fm. Both the RQMD and the VENUS model are
consistent with the data within the error in the correlation peak region.Comment: RevTeX style, 6 pages, 4 figures, 1 table. More discussion are added
about the structure on the tail of the correlation function. The systematic
error is revised. To appear in Phys. Lett.
Event-by-event fluctuations of average transverse momentum in central Pb+Pb collisions at 158 GeV per nucleon
We present first data on event-by-event fluctuations in the average
transverse momentum of charged particles produced in Pb+Pb collisions at the
CERN SPS. This measurement provides previously unavailable information allowing
sensitive tests of microscopic and thermodynamic collision models and to search
for fluctuations expected to occur in the vicinity of the predicted QCD phase
transition. We find that the observed variance of the event-by-event average
transverse momentum is consistent with independent particle production modified
by the known two-particle correlations due to quantum statistics and final
state interactions and folded with the resolution of the NA49 apparatus. For
two specific models of non-statistical fluctuations in transverse momentum
limits are derived in terms of fluctuation amplitude. We show that a
significant part of the parameter space for a model of isospin fluctuations
predicted as a consequence of chiral symmetry restoration in a non-equilibrium
scenario is excluded by our measurement.Comment: 6 pages, 2 figures, submitted to Phys. Lett.
Single Spin Asymmetry in Polarized Proton-Proton Elastic Scattering at GeV
We report a high precision measurement of the transverse single spin
asymmetry at the center of mass energy GeV in elastic
proton-proton scattering by the STAR experiment at RHIC. The was measured
in the four-momentum transfer squared range \GeVcSq, the region of a significant interference between the
electromagnetic and hadronic scattering amplitudes. The measured values of
and its -dependence are consistent with a vanishing hadronic spin-flip
amplitude, thus providing strong constraints on the ratio of the single
spin-flip to the non-flip amplitudes. Since the hadronic amplitude is dominated
by the Pomeron amplitude at this , we conclude that this measurement
addresses the question about the presence of a hadronic spin flip due to the
Pomeron exchange in polarized proton-proton elastic scattering.Comment: 12 pages, 6 figure
Longitudinal double-spin asymmetry and cross section for inclusive neutral pion production at midrapidity in polarized proton collisions at sqrt(s) = 200 GeV
We report a measurement of the longitudinal double-spin asymmetry A_LL and
the differential cross section for inclusive Pi0 production at midrapidity in
polarized proton collisions at sqrt(s) = 200 GeV. The cross section was
measured over a transverse momentum range of 1 < p_T < 17 GeV/c and found to be
in good agreement with a next-to-leading order perturbative QCD calculation.
The longitudinal double-spin asymmetry was measured in the range of 3.7 < p_T <
11 GeV/c and excludes a maximal positive gluon polarization in the proton. The
mean transverse momentum fraction of Pi0's in their parent jets was found to be
around 0.7 for electromagnetically triggered events.Comment: 6 pages, 3 figures, submitted to Phys. Rev. D (RC
Azimuthal anisotropy and correlations in p+p, d+Au and Au+Au collisions at 200 GeV
We present the first measurement of directed flow () at RHIC. is
found to be consistent with zero at pseudorapidities from -1.2 to 1.2,
then rises to the level of a couple of percent over the range . The latter observation is similar to data from NA49 if the SPS rapidities
are shifted by the difference in beam rapidity between RHIC and SPS.
Back-to-back jets emitted out-of-plane are found to be suppressed more if
compared to those emitted in-plane, which is consistent with {\it jet
quenching}. Using the scalar product method, we systematically compared
azimuthal correlations from p+p, d+Au and Au+Au collisions. Flow and non-flow
from these three different collision systems are discussed.Comment: Quark Matter 2004 proceeding, 4 pages, 3 figure
Measurement of the parity-violating longitudinal single-spin asymmetry for boson production in polarized proton-proton collisions at GeV
We report the first measurement of the parity violating single-spin
asymmetries for midrapidity decay positrons and electrons from and
boson production in longitudinally polarized proton-proton collisions
at GeV by the STAR experiment at RHIC. The measured asymmetries,
and , are consistent with theory
predictions, which are large and of opposite sign. These predictions are based
on polarized quark and antiquark distribution functions constrained by
polarized DIS measurements.Comment: 6 pages, 4 figures, submitted to Physics Review Letter
Azimuthal anisotropy: the higher harmonics
We report the first observations of the fourth harmonic (v_4) in the
azimuthal distribution of particles at RHIC. The measurement was done taking
advantage of the large elliptic flow generated at RHIC. The integrated v_4 is
about a factor of 10 smaller than v_2. For the sixth (v_6) and eighth (v_8)
harmonics upper limits on the magnitudes are reported.Comment: 4 pages, 6 figures, contribution to the Quark Matter 2004 proceeding
- …